Scalable Load Balancing in Networked Systems: A Survey of Recent Advances

نویسندگان

چکیده

In this survey we provide an overview of recent advances on scalable load balancing schemes which favorable delay performance and yet require minimal implementation overhead. The basic scenario involves a single dispatcher where tasks arrive that must immediately be forwarded to one $N$ single-server queues. join-the-shortest-queue (JSQ) policy yields vanishing delays as grows large, in centralized queuing arrangement, but prohibitive communication burden. contrast, JSQ($d$) assign incoming task server with the shortest queue among $d$ servers selected uniformly at random little communication, lead constant delays. order examine fundamental trade-off between overhead, discuss body research JSQ($d(N)$) diversity parameter $d(N)$ depends investigate growth rate required match optimal JSQ fluid diffusion scales. Stochastic coupling techniques scaling limits play instrumental role establishing asymptotic optimality. We demonstrate how methodology carries over infinite-server settings, finite buffers, multiple dispatchers, arranged graph topologies, token-based such join-the-idle-queue (JIQ), thus providing broad main trends field.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Load Balancing Approaches for Web Servers: A Survey of Recent Trends

Numerous works has been done for load balancing of web servers in grid environment. Reason behinds popularity of grid environment is to allow accessing distributed resources which are located at remote locations. For effective utilization, load must be balanced among all resources. Importance of load balancing is discussed by distinguishing the system between without load balancing and with loa...

متن کامل

Scalable Load Balancing in Cluster Storage Systems

Enterprise and cloud data centers are comprised of tens of thousands of servers providing petabytes of storage to a large number of users and applications. At such a scale, these storage systems face two key challenges: (a) hot-spots due to the dynamic popularity of stored objects and (b) high reconfiguration costs of data migration due to bandwidth oversubscription in the data center network. ...

متن کامل

Scalable Load Balancing in Networked Systems: Universality Properties and Stochastic Coupling Methods

We present an overview of scalable load balancing algorithms which provide favorable delay performance in large-scale systems, and yet only require minimal implementation overhead. Aimed at a broad audience, the paper starts with an introduction to the basic load balancing scenario – referred to as the supermarket model – consisting of a single dispatcher where tasks arrive that must immediatel...

متن کامل

Scalable Load-Distance Balancing

We introduce the problem of load-distance balancing in assigning users of a delay-sensitive networked application to servers. We model the service delay experienced by a user as a sum of a network-incurred delay, which depends on its network distance from the server, and a server-incurred delay, stemming from the load on the server. The problem is to minimize the maximum service delay among all...

متن کامل

Scalable Load Balancing

In this paper, we develop load balancing strategies for scalable high-performance parallel A* algorithms suitable for distributed-memory machines. In parallel A* search, ineeciencies such as processor starvation and search of nonessential spaces (search spaces not explored by the sequential algorithm) grow with the number of processors P used, thus restricting its scalability. To alleviate this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Siam Review

سال: 2022

ISSN: ['1095-7200', '0036-1445']

DOI: https://doi.org/10.1137/20m1323746